Cumulative distribution networks: Inference, estimation and applications of graphical models for cumulative distribution functions
نویسندگان
چکیده
Cumulative distribution networks: Inference, estimation and applications of graphical models for cumulative distribution functions Jim C. Huang Doctor of Philosophy Graduate Department of Electrical and Computer Engineering University of Toronto 2009 This thesis presents a class of graphical models for directly representing the joint cumulative distribution function (CDF) of many random variables, called cumulative distribution networks (CDNs). Unlike graphical models for probability density and mass functions, in a CDN, the marginal probabilities for any subset of variables are obtained by computing limits of functions in the model. We will show that the conditional independence properties in a CDN are distinct from the conditional independence properties of directed, undirected and factor graph models, but include the conditional independence properties of bidirected graphical models. As a result, CDNs are a parameterization for bidirected models that allows us to represent complex statistical dependence relationships between observable variables. We will provide a method for constructing a factor graph model with additional latent variables for which graph separation of variables in the corresponding CDN implies conditional independence of the separated variables in both the CDN and in the factor graph with the latent variables marginalized out. This will then allow us to construct multivariate extreme value distributions for which both a CDN and a corresponding factor graph representation exist. In order to perform inference in such graphs, we describe the ‘derivative-sum-product’ (DSP) message-passing algorithm where messages correspond to derivatives of the joint ii cumulative distribution function. We will then apply CDNs to the problem of learning to rank, or estimating parametric models for ranking, where CDNs provide a natural means with which to model multivariate probabilities over ordinal variables such as pairwise preferences. We will show that many previous probability models for rank data, such as the Bradley-Terry and Plackett-Luce models, can be viewed as particular types of CDN. Applications of CDNs will be described for the problems of ranking players in multiplayer team-based games, document retrieval and discovering regulatory sequences in computational biology using the above methods for inference and estimation of CDNs.
منابع مشابه
Exact inference and learning for cumulative distribution functions on loopy graphs
Abstract Many problem domains including climatology and epidemiology require models that can capture both heavy-tailed statistics and local dependencies. Specifying such distributions using graphical models for probability density functions (PDFs) generally lead to intractable inference and learning. Cumulative distribution networks (CDNs) provide a means to tractably specify multivariate heavy...
متن کاملBeta-Linear Failure Rate Distribution and its Applications
We introduce in this paper a new four-parameter generalized version of the linear failure rate distribution which is called Beta-linear failure rate distribution. The new distribution is quite flexible and can be used effectively in modeling survival data and reliability problems. It can have a constant, decreasing, increasing and bathtub-shaped failure rate function depending on its parameter...
متن کاملCumulative Distribution Networks and the Derivative-sum-product Algorithm: Models and Inference for Cumulative Distribution Functions on Graphs
We present a class of graphical models for directly representing the joint cumulative distribution function (CDF) of many random variables, called cumulative distribution networks (CDNs). Unlike graphs for probability density and mass functions, in a CDN, the marginal probabilities for any subset of variables are obtained by computing limits of functions in the model. We will show that the cond...
متن کاملA hybrid method to find cumulative distribution function of completion time of GERT networks
This paper proposes a hybrid method to find cumulative distribution function (CDF) of completion time of GERT-type networks (GTN) which have no loop and have only exclusive-or nodes. Proposed method is cre-ated by combining an analytical transformation with Gaussian quadrature formula. Also the combined crude Monte Carlo simulation and combined conditional Monte Carlo simulation are developed a...
متن کاملEfficient Estimation of the Density and Cumulative Distribution Function of the Generalized Rayleigh Distribution
The uniformly minimum variance unbiased (UMVU), maximum likelihood, percentile (PC), least squares (LS) and weighted least squares (WLS) estimators of the probability density function (pdf) and cumulative distribution function are derived for the generalized Rayleigh distribution. This model can be used quite effectively in modelling strength data and also modeling general lifetime data. It has...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009